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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(IKaPPLICANT: Dale. James B. 

(i1) tWlE of INVENTION: GROUP A STREPTOCOCCAL VACCINES 
(111) NUMsk OF SEQUENCES: 16 

(1v) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREHy 6300 Columbia Center. 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: W\sh1ngton 

(E) COUNTRY: 

(F) ZIP: 98104 



(V) COMPUTER READABLE KORM: 

(A) MEDIUM TYPE: FToppy s/isk 

(B) COMPUTER: IBM PC\co/pat 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: Patent I 



YATA: 



)le 
5/MS-dpS 



ase #l\QM(ers1on #1.30 



(v1) CURRENT APPLICATION 

(A) APPLICATION NUMSER^US 

(B) FILING DATE: 12f-SEF-1997 

(C) CLASSIFICATION: 

(vlll) ATTORNEY/AGENT INFORMATION: 

(A) NAME: McMasters , David D. 

(B) REGISTRATION NUMBER: 33.963 

(C) REFERENCE/DOCKET NUMBER: 481112. 

(1x) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



lOPl 



(2) INFORMATION FOR SEQ ID N0:1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1 
GGGGGGGCAT CGGTCQCGAC TAGGTCTCAG ACAGAT 
(2> INFORMATION FORS^EQ ID NO: 2: 

(i) SEQUENCE CHAR^TERISTICS: 

(A) LENGTH: base pairs 

(B) TYPE: nucldvic acid 

(C) STRANDEDNESS\ single 

(D) TOPOLOGY: linf 
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(xi) SEQUENCE DESCRIPTION: S) 
GGGGGGGGAT CCACGTAGTT TCTCTTTA^ 
(2) INFORMATION FOR SEQ ID NOi 

(i) SEQUENCE CHARACTERISffCfS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
GGGGGGGGAT CCGCCGTGAC TAGGGGTACA 
(2) INFORMATION FOR SEQ ID NQ:4: 



30 
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\) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
^(C) STRANDEDNESS: single 

id) TOPOLOGY: linear 



(xi) SEQUENCe\)ESCRIPTION: SEQ ID N0:4: 
GGGGGGGTCG ACCTCAGTi\tTAACCCTTC 
(2) INFORMATION FOR SEQMD N0:5: 

(i) SEQUENCE CHARACT^ISTICS : 

(A) LENGTH: 30 ba\e pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: s\|ngle 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: S/Q m^O:S: 
GGGGGGGTCG ACAGAGTGTT TCCTAGGGG 
(2) INFORMATION FOR SEQ ID N0:6\: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
GGGGGGCCAT GGTAACTTGT CATTATTAGC 



30 
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(2) INFlSRMATION FOR SEQ ID NO: 7: 

(dVqUENCE CHARACTERISTICS: 
\) LENGTH: 30 base pairs 



(6o TYPE: nucleic acid 
(C\STRANDEDNESS: single 
(D) \0P0L06Y: linear 



(xi) SEQUENCE DESCRJPTION: SEQ ID NO: 7 
GGGGGGCCAT GGAGAGTGCG TTMACTAGG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTER I StVs^ 

(A) LENGTH: 30 base pa^s 

(B) TYPE: nucleic acid/ 

(C) STRANDEDNESS: sin^leN 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
GGGGGGCTGC AGAGATAACT TCTCATTCTG 
(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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()^0 SEQUENCE DESCRIPTION; SEQ ID NO: 9: 
GGGGGGCTQC AGAACGGTGA TGGTAATCCT 
(2) INFORHATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) iVnGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOKOGY: linear 



(xi) SEQUENCE DESCRIPTICIN : SEQ ID NO: 10: 
GGGGGGGGTA CCAGCTCTCT TAAAATCTCT 
(2) INFORMATION FOR SEQ ID N0:1] 

(i) SEQUENCE CHARACTER I ST l6s :^ 

(A) LENGTH: 30 base /airs' 

(B) TYPE: nucleic adid 

(C) STRANDEDNESS: si|ngl( 

(D) TOPOLOGY: lineat 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGGGGGGGTA CCTTGTTAGA TCAGGTTACA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



30 
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(xdVqUENCE DESCRIPTION: SEQ ID NO: 12: 



GGGGGGATCG toTTTAACT CTTGTAACAG 
(2) INFORMA™ for SEQ ID NO: 13: 

( 1 ) SEOUENCe CHARACTERISTICS : 

(A) LENIsTH: 30 base pairs 

(B) TYPEXnucleic acid 

(C) STRANDEDNESS: single 

(D) T0P0L0g\ linear 



(xi) SEQUENCE DESCRIPTIOnX SEQ ID^ 
GGGGGGATCG ATGTCGCGAC TAGGTCTCAC 
(2) INFORMATION FOR SEQ ID NO: 14/ 

(i) SEQUENCE CHARACTERIST^S: 

(A) LENGTH: 33 base fairs 

(B) TYPE: nucleic aC|id 

(C) STRANDEDNESS: siligle 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGGGGGAAGC TTTTACTTAC GTGCCTCTAA TTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



33 



31 



(D) TOPOLOGY: linear 



(1x) \EATURE: 



{A) NAME/ KEY: CDS 
. (BO LOCATION: 1. .1149 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 



GCA TGC ATG GTC 
Ala Cys Met Val 
1 

CAA GAA CGT GCT 
Gin Glu Arg Ala 
20 

AAG AAT AGT GAC 
Lys Asn Ser Asp 
35 

GAT GAG TTA ACT 
Asp Glu Leu Thr 
50 

TCC GCC GTG ACT 
Ser Ala Val Thr 
65 

GCT CTT GAC AAG 
Ala Leu Asp Lys 



GCGVACT AGG TCT CAG ACA GAT ACT 
Ala Thr Arg Ser Gin Thr Asp Thr 
5 \ 10 

GAC AAG Wt GAG ATA GAA AAC AAT 
Asp Lys P|\g1u He Glu Asn Asn 
25 

TTA AGT TTT Aa\ AAT AAA GCG TTA 
Leu Ser Phe Asn\sn Lys Ala Leu 
40 



GAA GAG TTG AGT AAT 
Glu Glu Leu Ser Asn 
55 

AGG GGT ACA ATA AAT 
Arg Gly Thr He Asn 

70 



'T AAA SAG 
ita Lys /g\u 

>. / I 

GAC CCS\CAA 
Asp Pr/o &ln 



CTG GAA AAA GTA 
Leu Glu Lys Val 
15 

ACG TTA AAA CTT 
Thr Leu Lys Leu 
30 

AAA GAT CAT AAT 
Lys Asp His Asn 
45 

AAA CTA CGT GGA 
Lys Leu Arg Gly 



GAA GGG TTA AAA 
Glu Gly Leu Lys 
100 

TTA AAA ACT GAG 
Leu Lys Thr Glu 
115 

CCT AGG GGG ACG 
Pro Arg Gly Thr 



TAT GAG CTA GAA AAC CAT GAC TTA 
Tyr Glu Leu Glu Asn His Asp Leu 
85 90 

ACT GAG AAT GAA GGG TTA AAA ACT 
Thr Glu Asn Glu Gly Leu Lys Thr 
105 

AAT GAA GGG TTA AAA ACT GAG GTC 
Asn Glu Gly Leu Lys Thr Glu Val 
120 



AGA GCA AAA GAA 
Arg Ala Lys Glu 
80 

ACT AAG AAT 
Ly^Thr Lys Asn 
95 

GAG AAX GAA GGG 
Glu Asn\lu Gly 
110 

GAC AGA GTGYtT 
Asp Arg Val Pl\e 

125 



GTA GAA AAC CCG GAC AAA GCA CGA 
Val Glu Asn Pro Asp Lys Ala Arg 



GAA CTT CTT AAC 
Glu Leu Leu Asn 



48 



96 



144 



192 



240 



288 



336 



384 



432 
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.130 



135 



140 



AAG TAT GAC GTA GAG AAC TCT ATG TTA CAA GCT AAT AAT GAC AAG TTA 480 
lys TvXasp Val 61 u Asn Sen Met Leu Gin Ala Asn Asn Asp Lys Leu 
US \ 150 155 160 



CCA TGG AGA\GTG CGI TAT ACT AGG CAT ACG CCA GAA GAT AAG CTA AAA 
Pro Trp ArgVal Arg Tyr Thr Arg His Thr Pro Glu Asp Lys Leu Lys 
165 170 175 

AAA ATT ATT GAC GAT CTT GAC GCA AAA GAA CAT GAA TTA CAA CAA CAG 
Lys He He Asp As\Leu Asp Ala Lys Glu His Glu Leu Gin Gin Gin 
180 \ 185 190 

AAT GAG AAG TTA TCT Ct\cAG AAC GGT GAT GGT AAT CCT AGG GAA GTT 
Asn Glu Lys Leu Ser Leu\ln Asn Gly Asp Gly Asn Pro Arg Glu Val 
195 \ 200 205 

ATA GAA GAT CTT GCA GCA AACAAT CCC GCA ATA CAA AAT ATA CGT TTA 
He Glu Asp Leu Ala Ala Asn aVi ProAla He Gin Asn He Arg Leu 
210 215 \ / 220/ 



CGT CAC GAA AAC AAG GAC TTA 
Arg His Glu Asn Lys Asp Leu Ly 
225 230 



GTT GCA GGA AGA GAT TTT AAG A 
Val Ala Gly Arg Asp Phe Lys A|-g 
245 



"GAG AAT GCA ATG GAA 
Arg Leu Glu Asn Ala Met Glu 
235 240 

,A aCT GG1\ACC TTG TTA GAT CAG GTT 
la Gly \ir Leu Leu Asp Gin Val 

255 



250 



ACA CAA TTA TAT ACT AAA CAT AAT AGT AAT TAC CM CAA TAT AAT GCA 
Thr Gin Leu Tyr Thr Lys His Asn Ser Asn Tyr GlrKGln Tyr Asn Ala 
260 265 \ 270 

CAA GCT GGC AGA CTT GAC CTG AGA CAA AAG GCT GAA TATVtA AAA GGC 
Gin Ala Gly Arg Leu Asp Leu Arg Gin Lys Ala Glu Tyr Deu Lys Gly 
275 280 285 

CTT AAT GAT TGG GCT GAG AGG CTG TTA CAA GAG TTA AAT ATC GTC 
Leu Asn Asp Trp Ala Glu Arg Leu Leu Gin Glu Leu Asn He Asfx Val 
290 295 300 

GCG ACT AGG TCT CAG ACA GAT ACT CTG GAA AAA GTA CAA GAA CGT GC1 
Ala Thr Arg Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala 
305 310 315 320 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 
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6AC TTT GAG ATA GAA AAC AAT ACG TTA AAA CTT AAG AAT AGT 6AC 
Asp lVs Phe Glu He Glu Asn Asn Thr Leu Lys Leu Lys Asn Ser Asp 
325 330 



335 



TTA AGT lYT AAT AAT AAA GCG TTA AAA GAT CAT AAT GAI GAG I lA ACT 
Leu Ser Ph^ Asn Asn Lys Ala Leu Lys Asp His Asn Asp Glu Leu Thr 
\340 345 350 

GAA GAG TTG AG\ AAT GCT AAA GAG AAA CTA CGT AAA AAT GAT AAA TCA 
Glu Glu Leu Ser\sn Ala Lys Glu Lys Leu Arg Lys Asn Asp Lys Ser 
355 \ 360 365 

CTA TCT GAA AAA GCt\t AAA ATT CAA GAA TTA GAG GCA CGT AAG 
Leu Ser Glu Lys Ala SeK Lys He Gin Glu Leu Glu Ala Arg Lys 
370 ^75 380 

TAAAAGCTT 



1008 



1104 



1149 



1158 



(2) INFORMATION FOR SEQ ID NO:: 

(1) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 383 amino'^c 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



ids 



(ii) MOLECULE TYPE: proteii 
(xi) SEQUENCE DESCRIPTION :[ SEQ /D m\6: 

Ala Cys Met Val Ala Thr Arg Ser WThr Asp Tl^i; 
1 5 10 



Leu Glu Lys Val 
15 



Gin Glu Arg Ala Asp Lys Phe Glu He Glu Asn Asn TK^Leu Lys Leu 

20 25 \ 30 

Lvs Asn Ser Asp Leu Ser Phe Asn Asn Lys Ala Leu Lys A^His Asn 
35 40 45 \ 

Asp Glu Leu Thr Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu Af^ Gly 
50 55 60 

Ser Ala Val Thr Arg Gly Thr He Asn Asp Pro Gin Arg Ala Lys g\u 
65 70 75 8f» 
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A1a\Leu Asp Lys Tyr Glu Leu G1u Asn His Asp Leu Lys Thr Lys Asn 
85 90 95 

G lu pii lys Thr Glu Asn Glu G1y Leu Lys Thr G1u As n Glu Gly 



100 



105 



TTO" 



Leu Lys Th\Glu Asn Glu Gly Leu Lys Thr G1u Val Asp Arg Val Phe 
115 \ 120 125 

Pro Arg Gly Thr\a1 Glu Asn Pro Asp Lys Ala Arg Glu Leu Leu Asn 
130 \ 135 140 

Lys Tyr Asp Val Glu\sn Ser Met Leu Gin Ala Asn Asn Asp Lys Leu 
145 l\o 155 160 

Pro Trp Arg Val Arg Tyr\hr Arg His Thr Pro Glu Asp Lys Leu Lys 
165 \ 170 175 

Lys He He Asp Asp Leu AspVla Lys Glu His Glu Leu Gin Gin Gin 
180 \ 185 190 

Asn Glu Lys Leu Ser Leu Gin Asn Xly A^ 
195 200 



He Glu Asp Leu Ala Ala Asn Asn Pi^ 
210 215 



Asn Pro Arg Glu Val 

205 

Gin Asn He Arg Leu 
220 



Arg His Glu Asn Lys Asp Leu Lys 
225 230 

Val Ala Gly Arg Asp Phe Lys Arg 
245 



\la At/g 



^u G>d Asn Ala Met Glu 

240 



Al/Gly Thr 
250 



\eu Leu Asp Gin Val 
255 



Thr Gin Leu Tyr Thr Lys His Asn Ser Asn Tyr Gin Sin Tyr Asn Ala 
260 265 \ 270 

Gin Ala Gly Arg Leu Asp Leu Arg Gin Lys Ala Glu TyrYeu Lys Gly 
275 280 285 

Leu Asn Asp Trp Ala Glu Arg Leu Leu Gin Glu Leu Asn He\Asp Val 
290 295 300 

Ala Thr Arg Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg\Ala 
305 310 315 ^0 
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ASD Lys PF^Glu He Glu Asn Asn Thr Leu Lys Leu Lys Asm Ser Asp 
^ 325 330 335 



I PN w Phe Asn AsnH.ys A1a Leu>y^P His Asn Asp Glu Leu Thr 



340 

Glu Glu Leu Ser Asn Ala 
355 



eu Arg Lys Asn Asp Lys Ser 
365 



Leu Ser Glu Lys Ala Ser Ly 
370 37|5 




Glu Ala Arg Lys 



